首页> 外文OA文献 >Deep Speaker Feature Learning for Text-independent Speaker Verification
【2h】

Deep Speaker Feature Learning for Text-independent Speaker Verification

机译:深度扬声器功能学习,用于独立于文本的扬声器验证

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Recently deep neural networks (DNNs) have been used to learn speakerfeatures. However, the quality of the learned features is not sufficientlygood, so a complex back-end model, either neural or probabilistic, has to beused to address the residual uncertainty when applied to speaker verification,just as with raw features. This paper presents a convolutional time-delay deepneural network structure (CT-DNN) for speaker feature learning. Ourexperimental results on the Fisher database demonstrated that this CT-DNN canproduce high-quality speaker features: even with a single feature (0.3 secondsincluding the context), the EER can be as low as 7.68%. This effectivelyconfirmed that the speaker trait is largely a deterministic short-time propertyrather than a long-time distributional pattern, and therefore can be extractedfrom just dozens of frames.
机译:最近,深度神经网络(DNN)已用于学习说话者特征。但是,学习到的特征的质量不够好,因此,像原始特征一样,当应用于说话人验证时,必须使用复杂的后端模型(神经或概率模型)来解决剩余的不确定性。本文提出了一种用于说话人特征学习的卷积时延深度神经网络结构(CT-DNN)。我们在Fisher数据库上的实验结果表明,这种CT-DNN可以产生高质量的说话者特征:即使具有单个特征(包括上下文在内,为0.3秒),EER仍可低至7.68%。这有效地证实了说话人特征在很大程度上是确定性的短时属性,而不是长时间的分配模式,因此可以从几十个帧中提取出来。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号